Load Dataset

Profile Report

Manual EDA

From above information we conclude that

There are way many people having the disease compared to the healthy ones.

The above histograms indicate higher chance of having the disease with the readings showing higher frequencies.

Most of the values are distributed asymmetricaly like MDVP:Fo(Hz), MDVP:Flo(Hz), MDVP:Shimmer

Almost all the features show higher correlation with the target which is status

Build Model

K Nearest Neighbors

Logistic Regression

Decision Tree

Naive Bayes

Support Vector Machine

Random Forest Classifier

Store and Load the Model